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Description 

Related Applications 

The subject matter of this application is related to 
the subject matter of the following applications: 
European patent application 961 01 842. 1 ; 
European patent application 96101839.7; 
European patent application 96101840.5; 
European patent application 96101841.3; 
the European patent application entitled "RECLAMA- 
TION OF PROCESSOR RESOURCES IN A DATA 
PROCESSOR"; 

the European patent application entitled "METHOD 
AND APPARATUS FOR SELECTING INSTRUCTIONS 
FROM ONES READY TO EXECUTE"; 
the European patent application entitled "METHOD 
AND APPARATUS FOR ACCELERATING CONTROL 
TRANSFER RETURNS"; 

the European patent application entitled "METHODS 
FOR UPDATING FETCH PROGRAM COUNTER"; 
the European patent application entitled "METHOD 
AND APPARATUS FOR RAPID EXECUTION OF CON- 
TROL TRANSFER INSTRUCTIONS"; 
the European patent application entitled "METHOD 
AND APPARATUS FOR PRIORITIZING AND HAN- 
DLING ERRORS IN A COMPUTER SYSTEM"; 
the European patent application entitled- "METHOD 
AND APPARATUS FOR GENERATING A ZERO BIT 
STATUS FLAG IN A MICROPROCESSOR"; 
and 

the European patent application entitled "ECC PRO- 
TECTED MEMORY ORGANIZATION WITH PIPE- 
LINED READ-MODIFY-WRITE ACCESS", 
the latter eight of which are filed simultaneously with this 
application. 

Background of the Invention 

Technical Field This invention relates to methods 
and systems for handling exceptions in computer sys- 
tems, and, in particular, to methods and systems for 
handling exceptions triggered by unimplemented 
instructions using a combination of hardware and soft- 
ware. 

Related Art It is not always possible or desirable to 
implement in hardware all of the instructions defined in 
the specification of a particular microprocessor archi- 
tecture. For example, some instructions may be 
included in a specification to support software written for 
earlier versions of the microprocessor. Including hard- 
ware to implement these instructions can have a sub- 
stantial impact on the critical path of the processor, and 
when these instructions are used only infrequently, the 
efficiency cost of including the necessary hardware may 
not be justified. In such situations, designers often use 
supervisory software to retrieve, interpret, and emulate 
the unimplemented instructions. 



Conventional methods for software emulation of 
unimplemented instructions employ trap vectors to 
transfer control of program flow from the processor to a 
software routine or emulation code in the supervisory 

5 software. Instructions are specified by various bit fields 
in a corresponding instruction code. Typically, all unim- 
plemented instructions having the same instruction type 
field are processed by a single trap vector, independent 
of the instruction parameters specified in the remaining 

ro bit fields (parameter fields) of the instruction code. For 
example, all unimplemented load instructions use the 
same trap vector to access emulation code, independ- 
ent of the source and destination registers specified by 
the instruction. These parameters are obtained by the 

75 emulation software pointed to by the trap vector. Thus, 
the emulation code must include transfer control func- 
tions such as a branch table to access the unimple- 
mented instruction in memory, as well as routines to 
interpret the fields in the instruction code, and process 

20 the unimplemented instruction using instructions for 
which the processor includes implementing hardware. 

The transfer control, interpretation, and processing 
routines included in the emulation code add a substan- 
tial number of steps to the code, and increase the time 

25 necessary to run the emulation code. For example, 
emulation code suitable for handling retrieving load 
instructions from memory, identifying the source and 
destination registers specified in the instruction, and 
processing the instruction using implemented instruc- 

30 tions may require twenty to forty steps in assembler lan- 
guage, of which the actual processing routine only 
represents seven or eight steps. Thus, there is a need 
for faster software emulation processes that rely on 
fewer code steps to emulate unimplemented instruc- 

35 tions. 

Summary of the Invention 

The present invention is a system and method for 

40 software emulation of unimplemented instructions using 
emulation codes specific to the instruction type and 
parameter fields of the unimplemented instructions. For 
each unimplemented instruction, an emulation code 
specific to the instruction type and parameter fields of 

45 the corresponding instruction code is prepared and 
stored at a memory address related to these fields. A 
processor includes issue trap logic to read the informa- 
tion type and parameter fields when the unimplemented 
instruction triggers an exception. The issue trap logic 

so then calculates the memory address of the correspond- 
ing emulation code from the read fields and transfers 
control directly to this emulation code. The issue trap 
logic also transfers selected parameter fields of the 
unimplemented instruction to registers in the processor. 

55 where the data is available for processing by the emula- 
tion code. 

Because the issue trap logic reads the information 
type and parameter fields of an unimplemented instruc- 
tion as part of the exception handling process, the emu- 
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lation code need not contain routines for retrieving the 
unimplemented instruction from memory and interpret- 
ing the parameters fields from the retrieved instruction. 
Instead, the emulation code is tailored to parameter and 
instruction type fields of the corresponding unimple- 5 
merited instruction and consequently runs faster. By 
storing the emulation codes at memory addresses 
related to the fields read by the issue trap logic, the logic 
can branch directly to the appropriate emulation when 
the exception is triggered. 10 

In accordance with the present invention, the issue 
trap logic identifies unimplemented instructions and cal- 
culates a memory address for a trap vector in an emula- 
tion code table using an offset determined by the 
instruction type and parameter fields. The fields are left is 
shifted to ensure that the trap vectors are sufficiently 
s parated to accommodate complete emulation code 
sequences within the emulation code table itself. The 
processor branches directly to the trap vector which 
forms the first instruction in an emulation code 20 
sequence specific to the parameter and instruction type 
fields of the unimplemented instruction. The operand 
information necessary to process the unimplemented 
instruction is read from the processor register by the 
emulation code. 25 

Brief Description of the Drawings 

Fig. 1 is a schematic representation of a prior art 
method for processing unimplemented instructions. 30 

Fig. 2 is a schematic representation of a method for 
processing unimplemented instructions in accordance 
with the present invention. 

Fig. 3 is a schematic representation of the method 
for generating an address in the emulation code table 35 
from an unimplemented instruction. 

Fig. 4 is a flow chart of a method for software emu- 
lation of unimplemented instructions in accordance with 
the present invention. 

Fig. 5 is a schematic diagram of the logic used to 40 
generate the address of Fig. 3 from the fields of an 
unimplemented instruction. 

Detailed Description of the Invention 

45 

Referring to Fig. 1, there is shown a schematic rep- 
resentation of a conventional system 100 for handling 
unimplemented instructions. System 100 comprises a 
processor 110 and a memory 1 50 coupled to processor 
110. Processor 110 includes issue trap logic 130 and so 
various processor registers 162, 164, 166 for trap han- 
dling, and memory 150 includes a trap table 140 and 
supervisory software 160 for trap handling. A program 
counter register (PC) 120 in processor 110 is shown 
holding an unimplemented instruction i16 from an 55 
instruction stream 122, which also includes imple- 
mented instructions 112, 114, 118, i.e. instructions for 
which processor 110 includes implementing logic. 



A base address 142 specifies the location of trap 
table 140 in memory 150 and a trap vector 144 corre- 
sponding to the instruction type of unimplemented 
instruction 116 is at an offset 143 determined by the 
instruction type of unimplemented instruction 116. As 
shown, trap vector 144 points to an emulation code 152 
in supervisor software 160. Emulation code 152 is a 
general emulation code suitable for all unimplemented 
instructions 216 of a given instruction type. Accordingly, 
emulation code 152 includes routines for retrieving 
unimplemented instruction 216 of the given instruction 
type, interpreting the parameter fields that fully specify 
unimplemented instruction 216, and emulating the fully 
specified unimplemented instruction 216. 

In order to access emulation code 142, base 
address 142 is stored in processor register 162 and off- 
set 143 is calculated by trap logic 130 from the contents 
of processor registers 164, 166. For example, in proces- 
sors employing SPARC 64-bit architecture (V. 9), regis- 
ter 162 is a trap base address (TBA) register 162 that 
holds 49-bit table base address 1 42. Offset 143 is deter- 
mined by a one-bit trap level (TL) register 164 and a 
nine-bit trap type (TT) register 166 that is determined by 
the type of instruction causing the exception. The 
remainder of the 64-bit address of trap vector 144 is 
made up of zeroes. TBA register 162 is set by supervi- 
sor software at initialization. TL and TT registers 164, 
166, respectively, are loaded by issue logic 130 when 
an unimplemented instruction 116 triggers an excep- 
tion. 

Referring now to Fig. 2, there is shown a schematic 
representation of a system 200 for hardware-assisted 
software emulation of unimplemented instructions in 
accordance with the present invention. System 200 
comprises a processor 210 and a memory 250 associ- 
ated with processor 210 that includes an emulation 
code table 240. Process 210 includes a PC 220, issue 
trap logic 230, and trap handling registers 262, 264, 
266, 268. An instruction stream 222 is shown including 
implemented instructions 212, 214, 218 and unimple- 
mented instruction 216. 

Emulation code table 240, which may be formed by 
expanding trap table 140 (Fig. 1), comprises a base 
address 242 and a trap vector 244 corresponding to 
unimplemented instruction 216 at an offset 243 with 
respect to base address 242. Trap vector 244 forms the 
first step of an emulation code 252 tailored to the 
instruction type and parameter fields of unimplemented 
instruction 216, and is spaced sufficiently from other 
trap vectors (not shown) to accommodate all steps of 
emulation code 252. 

Because emulation code 252 is tailored to the 
instruction type and parameter fields of unimplemented 
instruction 21 6, it does not include routines for retrieving 
and interpreting unimplemented instruction 216. 
Instead, emulation code 252 is coupled directly to unim- 
plemented instruction 216 by offset 243, which is 
related to the instruction type and parameter fields of 
unimplemented instruction 216. When an exception is 
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triggered, issue trap logic 230 reads instruction type 
and parameter f ields in unimplemented instruction 216 
to determine offset 243 and passes control directly to 
emulation code 152. Upon completion of emulation 
code 252, control of processing is returned to instruc- 
tion 218 which follows unimplemented instruction 216. 

The method for emulating unimplemented instruc- 
tions will now be discussed with respect to the LDD and 
STD instructions defined in the SPARC architecture, 
versions 8 and 9. as discussed for example in D.L 
Weaver, T. Germond, The SPARC Architectural Man- 
ual", (1994) Prentice-Hall. The LDD and STD instruc- 
tions are deprecated in version 9 but must be supported 
in some manner in order to run software written for ver- 
sion 8. It should be understood that a person skilled in 
the art will recognize that the method is generally appli- 
cable and is not restricted to the specific processor 
architecture and instruction types discussed below. 

Referring now to Fig. 3, there is shown an instruc- 
tion code 310 in a format suitable for LDD (load double 
word) and STD (store double word) instructions. 
Instruction code 310 contains instruction type (opcode) 
fields 312, 314 and parameter fields 316, 318, 320. In 
the disclosed embodiment, parameter field 31 6 is a des- 
tination register field which indicates the register into 
which a value is to be loaded or from which a value is to 
be stored. Parameter fields 318, 320 are source register 
and immediate fields, respectively, which are combined 
to indicate the memory address to be loaded or stored. 
An address bit 322 is set in LDD and STD instruction to 
indicate that the memory address to be loaded or stored 
is determined by the sum of parameter fields 318, 320, 
i.e. source register and immediate fields, respectively 
(immediate addressing mode). 

Also shown in Fig. 3 is a 64 bit binary 330 that is 
generated by the method of the present invention to 
access emulation code 252 at trap vector 244. That is. 
binary 330 is the memory address at which emulation 
code 252 for the instruction type and parameter fields of 
unimplemented instruction 216 (Fig. 2) is accessed. 
Binary 330 comprises an emulation table base address 
field 340. an offset field 350, and a zero field 360. In the 
disclosed embodiment of the present invention, issue 
trap logic 230 of Fig. 2 retrieves a 48-bit base address 
242 for emulation table 240 from TBA register 262 of 
processor 210 and writes base address 242 to base 
address field 340. Issue trap logic 230 retrieves param- 
eter fields 316, 318 (destination and source register 
fields, respectively) from instruction code 310 and 
writes them to offset field 350, as indicated by dashed 
lines between instruction code 310 and binary 330. In 
the disclosed embodiment, parameter fields 316, 318 
are 5-bit fields, with the upper 4-bits of parameter field 
316 combined with the 5-bits of parameter field 318 to 
form offset 243 (Fig. 2). Issue trap logic 230 also 
retrieves immediate field 320 and writes it to a software 
accessible register 270 (Fig. 2) where it is available to 
emulation code 252 for processing unimplemented 
instruction 216. For example, in a SPARC based proc- 



essor, software accessible register 270 may be pro- 
vided by one of the ancillary state registers (%ASR[ r q) 
supported in SPARC architecture. Additional bits not 
shown in Fig. 3 contribute to offset 243 and are dis- 
cussed below in conjunction with issue trap logic 230. 

Referring now to Fig. 4, there is shown a flow chart 
summarizing method 400 of the present invention. An 
emulation code specific to the instruction type fields and 
selected parameter fields of an unimplemented instruc- 
ted tion is generated 410. The emulation code is then 
stored. 420 at a memory address that is related to the 
instruction type and parameter fields of its correspond- 
ing unimplemented instruction. When an unimple- 
mented instruction exception is detected 430, issue trap 
is logic 230 (Fig. 2) determines 440 the memory address 
of the corresponding emulation code from the instruc- 
tion type fields and selected parameter fields of the 
unimplemented instruction that triggered the exception. 
Method 400 then branches 450 to the determined mem- 
20 ory address, transferring control to the emulation code 
that has been tailored to the unimplemented instruction 
as specified by its instruction type fields and selected 
parameter fields. 

Referring now to Fig. 5, there is shown a represen- 
ts tation of issue trap logic 230 for implementing the emu- 
lation scheme of the present invention in the case of 
LDD and STD instructions defined in the SPARC Archi- 
tecture. Logic unit 51 0 represents the logic gates neces- 
sary to identify LDD and STD instructions in processor 
30 210 (Fig. 2). For example, LDD and STD instructions 
are 32 bit instructions having specific opcodes and 
employing immediate addressing mode. Accordingly, 
the address mask (AM) field of the PSTATE register in 
processor 210 is set (PSTATE. AM « 1) to mask the 
35 upper 32 bits for processing, the /-field of LDD and STD 
instruction codes is set to indicate immediate address- 
ing (i == 1), and the opcodes. Finally, bit 23 (instruction 
type field) of instruction codes 310 distinguishes 
instructions LDD and STD from their alternate space 
40 counterparts LDDA and STDA, respectively. These 
three conditions are indicated schematically within logic 
unit 510. 

A control line 520 from logic unit 510 sets bit 15 of 
binary 330 (Fig. 3) directly and sets bit 14 and bits 13-5 
45 (corresponding to offset 243 of Fig. 2) through a pair of 
multiplexers 530, 540, respectively. As indicated earlier, 
the upper 48 bits of binary 330 are set directly by TBA 
register 162 and the lower 5 bits are zeroed. 

In order to write offset 243 to bits 14-5 when an 
so unimplemented instruction exception is triggered by an 
LDD or STD instruction, control line 520 is connected to 
select inputs 532, 542 of multiplexers 530, 540, respec- 
tively. A first data input 534 couples bit 21 (S) in the 
instruction type f ield of the LDD (S = 0) and STD (S = 1) 
55 instruction codes 310 to multiplexer 530, and second 
data input 536 couples TL register 164 of processor 210 
to multiplexer 530. Similarly, a first data input 544 cou- 
ples the sum of parameter fields 316 and 318 to multi- 
plexer 540, and a second date input 546 couples TT 
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register 166 to multiplexer 540. Control line 520 then 
couples inputs 534, 544 to bits 14-5 when set by logic 
unit 510 (LDD or STD detected) to form an offset 243 
into emulation code table 240 pointing to emulation 
code 1 52, which is specific to parameter fields 31 6, 31 8 5 
of LDD (STD) instructions. When control line 520 is 
reset by logic unit 510, inputs 536, 546 are coupled to 
bits 14-5 of binary 330, providing access to trap vectors 
for conventional exception handling. 

Referring now to Table 1 , there is shown an emula- 10 
Won code suitable for emulating LDD instructions speci- 
fying any source and destination registers and any 
immediate operand. 



Table 1 


LDD: 


rd 


%asr25. %g1 


add 


rs1,%g1,%g1 


srl 


%g1,0,%g1 


Idxa 


[%g1]ASI_ASJFjJSER, rd+1 


sllx 


rd+1,32, rd 


srl 


rd+1,0, rd+1 


done 




nop 





Referring now to Table 2, there is shown an emula- 
tion code sequence suitable for emulating an STD 
instruction specifying any source and destination regis- 35 
ters and any immediate operand. 



Table 2 


STD: 


rd 


%asr25, %g1 


srl 


rd+1,0, %g2 


sllx 


rd, 32, %g3 


or 


%g2, %g3, %g2 


add 


rs1,%g1.%g1 


srl 


%g1,0.%g1 


stxa 


%g2, [%g1]ASLASJFJJSER 


done 





Therefore, a system and method have been pre- 
sented for using hardware to support fast software emu- 
lation of unimplemented instructions. Emulation codes, 
which form the software component of the invention, are 



tailored to the instruction type and parameter fields of 
the unimplemented instructions and stored at memory 
addresses determined from these fields. The resulting 
reduction in the size of the emulation code speeds up 
the software emulation process. The issue trap logic, 
which forms the hardware component of the invention, 
is expanded to read these fields when an exception is 
triggered to determine the memory address of the 
appropriate emulation code and branch directly to this 
emulation code. 

Claims 

1 . A method for emulating an unimplemented instruc- 
tion specified by instruction-type and parameter 
fields in an instruction code, the method comprising 
the steps: 

generating a code sequence tor emulating 
the unimplemented instruction specified by the 
instruction-type and parameter fields; 

storing the code sequence at a memory 
address that is related to the instruction-type and 
parameter fields of the unimplemented instruction; 

when the unimplemented instruction is 
detected, determining the memory address of the 
code sequence from the instruction-type and 
parameter fields; and 

transferring control to the memory address 
of the code sequence to emulate the unimple- 
mented instruction. 

2. The method of claim 1 , further comprising the steps 
of: 

identifying an operand field of the unimple- 
mented instruction; and 

writing a value specified in the identified 
operand field to a register accessible to the code 
sequence. 

3. The method of claim 1 , wherein in the step of stor- 
ing the code sequence comprises the substeps of: 

forming an emulation code table having a 
base address in memory; 

forming an offset from the base address with 
an parameter field of the unimplemented instruc- 
tion: 

storing the code sequence at a memory 
address that comprises the base address and the 
offset. 

4. The method of claim 3, wherein the step of forming 
the offset comprises adding a value in a first param- 
eter field of the instruction code to a value in a sec- 
ond parameter field in the instruction code. 

5. A method for emulating an unimplemented instruc- 
tion specified by instruction-type and parameter 
fields in a system comprising a processor, a mem- 
ory coupled to the processor and indexed by mem- 
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ory addresses, and a code sequence stored in the 
memory that is suitable for emulating the unimpie- 
mented instruction . the method comprising the 
steps of: 

storing the code sequence at a memory 
address that is related to the instruction-type and 
parameter fields of the unimplemented instruction; 

when the unimplemented instruction is 
detected by the processor, retrieving the instruction 
type and parameter fields to determine the memory 
address of the code sequence; and 

transferring control to the code sequence at 
the determined memory address to emulate the 
unimplemented instruction. 

6. The method of claim 10, wherein the storing step 
comprises the substeps of: 

determining a base address for an emulation 
code table; and 

combining the base address with an offset 
determined by the instruction-type and parameter 
fields of the unimplemented instruction to form the 
memory address of the emulation code. 

7. The method of claim 5, wherein the step of retriev- 
ing the instruction-type and parameter fields to 
determine the memory address comprises the sub- 
steps of: 

retrieving the base address for the emulation 
code table from a selected register in the proces- 
sor; 

determining an offset from the instruction- 
type and parameter fields of the detected instruc- 
tion; and 

combining the offset and the base address 
to form the memory address of the code sequence. 

8. The method of claim 5, comprising the additional 
step of: 

reading an operand field from the detected 
unimplemented instruction; and 

writing the unimplemented instruction to a 
selected register which can be accessed by the 
code sequence. 

9. The method of claim 8, wherein the transferring 
step comprises the substeps of: 

passing control of the system to the code 
sequence; 

accessing the operand field value from the 
selected register as needed to emulate the unim- 
plemented instruction; and 

returning control to an instruction following 
the unimplemented instruction in an instruction 
queue. 

1 0. A system for implementing software emulation of an 
unimplemented instruction specified by instruction- 
type and parameter fields, the system comprising: 



a memory including a code sequence at a 
selected address for emulating the unimplemented 
instruction specified by the instruction type and 
parameter fields; and 
5 a processor including a first register for stor- 

ing a base address for the code sequence, a sec- 
ond register, and issue trap logic, the issue trap 
logic comprising: 

means for detecting the unimplemented 
io instruction; 

means for reading the instruction-type and 
parameter fields of the unimplemented instruction; 
and 

means for determining the selected address 
75 of the code sequence from the read instruction-type 
and parameter fields; and 

means for transferring control of the proces- 
sor to the code sequence at the selected address. 
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410 

GENERATE EMULATION CODE 
SPECIFIC TO INSTRUCTION TYPE & 
PARAMETER FIELDS 



420 

STORE EMULATION CODE AT 
ADDRESS RELATED TO 
INSTRUCTION TYPE & PARAMETER 
FIELDS 



430 

DETECT UNIMPLEMENTED 
INSTRUCTION 



440 

DETERMINE EMULATION CODE 
ADDRESS FROM INSTRUCTION 
TYPE & PARAMETER FIELDS OF 
D ETECTED UNIMPLEMENTED 
INSTRUCTION 



450 

BRANCH TO DETERMINED 
ADDRESS 



FIG. 4 
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